PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG042062t4
Common NameTCM_042062
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 737aa    MW: 80201.5 Da    PI: 7.3775
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG042062t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.42.4e-182683457
                      -SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
          Homeobox  4 RttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                        ++t+eq+e+Le++++++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Thecc1EG042062t4 26 YVRYTAEQVEALERVYAECPKPSSLRRQQLIRECpilsNIEPKQIKVWFQNRRCREKQ 83
                      5789****************************************************97 PP

2START159.72.3e-501653722204
                       HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEE CS
             START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galq 92 
                       +aee+++e+++ka+ ++  Wv+++ +++g++++ +f+ s+++sg a+ra+g+v  +++   +e+l+d++ W ++++  e+      g  g+++
  Thecc1EG042062t4 165 IAEETLAEFLSKATGTAVDWVQMPGMKPGPDSVGIFAISQSCSGVAARACGLVSLEPT-KIAEILKDRPSWFRDCRNLEVFTMFPAGngGTIE 256
                       7899******************************************************.7777777777*******9999999998888**** PP

                       EEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHH CS
             START  93 lmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwll 181
                       l +++++a+++l+p Rdf+++Ry+ +l+ g++v++++S++     p+    +++vRae+lpSg+li+p+++g+s +++v+h++l++++++++l
  Thecc1EG042062t4 257 LVYTQTYAPTTLAPaRDFWTLRYTTTLENGSLVVCERSLSGSGAGPSaaaAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHMNLEAWSVPEVL 349
                       ****************************************99999988999****************************************** PP

                       HHHHHHHHHHHHHHHHHHTXXXX CS
             START 182 rslvksglaegaktwvatlqrqc 204
                       r+l++s+ + ++k++ a+l++ +
  Thecc1EG042062t4 350 RPLYESSKVIAQKMTIAALRYIR 372
                       ******************99865 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.1432084IPR001356Homeobox domain
SMARTSM003893.1E-152288IPR001356Homeobox domain
SuperFamilySSF466892.44E-162487IPR009057Homeodomain-like
CDDcd000863.83E-162585No hitNo description
PfamPF000466.3E-162683IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.602.5E-182783IPR009057Homeodomain-like
CDDcd146862.10E-677116No hitNo description
PROSITE profilePS5084827.25155383IPR002913START domain
CDDcd088756.94E-68159375No hitNo description
SuperFamilySSF559613.3E-35164376No hitNo description
Gene3DG3DSA:3.30.530.201.3E-20164368IPR023393START-like domain
SMARTSM002341.8E-40164374IPR002913START domain
PfamPF018527.0E-48165372IPR002913START domain
PfamPF086701.1E-49592735IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009855Biological Processdetermination of bilateral symmetry
GO:0009944Biological Processpolarity specification of adaxial/abaxial axis
GO:0009956Biological Processradial pattern formation
GO:0010014Biological Processmeristem initiation
GO:0010051Biological Processxylem and phloem pattern formation
GO:0010089Biological Processxylem development
GO:0030154Biological Processcell differentiation
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 737 aa     Download sequence    Send to blast
MAMAVAQHRE SSSGSSINKH LDAGKYVRYT AEQVEALERV YAECPKPSSL RRQQLIRECP  60
ILSNIEPKQI KVWFQNRRCR EKQRKEASRL QTVNRKLTAM NKLLMEENDR LQKQVSQLVC  120
ENGYMRQQLH TVNASAATDA SCDSVVTTPQ HSLRDANNPA GLLSIAEETL AEFLSKATGT  180
AVDWVQMPGM KPGPDSVGIF AISQSCSGVA ARACGLVSLE PTKIAEILKD RPSWFRDCRN  240
LEVFTMFPAG NGGTIELVYT QTYAPTTLAP ARDFWTLRYT TTLENGSLVV CERSLSGSGA  300
GPSAAAAAQF VRAEMLPSGY LIRPCEGGGS IIHIVDHMNL EAWSVPEVLR PLYESSKVIA  360
QKMTIAALRY IRQIAQETSG EVVYGLGRQP AVLRTFSQRL SRGFNDAING FNDDGWSIMN  420
CDGAEDVIIA INSSKNLSSS SNPANALSFL GGVLCAKASM LLQNVPPAVL VRFLREHRSE  480
WADFNVDAYS AASLKAGTYS YPGMRPTSLE VGTATNHAAG DAPSCQNSRS VLTIALQFPF  540
DSNLQDNVAA MARQYVRSVI ASVQRVAMAI SPSGLSPTVG PKLSPGSPEA LTLAHWICQS  600
YSYHLGAELL RAESLGGDAV LKNLWQHQDA ILCCSLKSLP VFIFANQAGL DMLETTLVAL  660
QDITLDKIFD ESGRKALCSD FAKLMQQGFA YLPAGICMST MGRNVSYEQA VAWKVLAADE  720
STVHCLAFSF VNWSFV*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY9664460.0AY966446.1 Gossypium barbadense class III HD-zip protein mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007016752.10.0Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 4
SwissprotQ9SE430.0REV_ARATH; Homeobox-leucine zipper protein REVOLUTA
TrEMBLA0A061GY240.0A0A061GY24_THECC; Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 4
STRINGGLYMA11G20520.10.0(Glycine max)
STRINGGLYMA12G08080.10.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60690.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]